1|Q I Arizona State 
University 



Teachers’ Views on High-stakes Testing: 
Implications for the Classroom 
Policy Brief 
by 

Lisa M. Abrams 
Boston College 



Education Policy Research Unit (EPRU) 

Education Policy Studies Laboratory 
College of Education 

Division of Educational Leadership and Policy Studies 
Box 872411 

Arizona State University 
Tempe, AZ 85287-2411 



February 2004 



EPSL 



EDUCATION POLICY STUDIES LABORATORY 
Education Policy Research Unit 



EPSL-0401-104-EPRU 

http://edpolicvlab.org 



Education Policy Studies Laboratory 

Division of Educational Leadership and Policy Studies 
College of Education, Arizona State University 
P.0. Box 87241 1 , Tempe, AZ 85287-241 1 
Telephone: (480) 965-1886 
Fax: (480) 965-0303 
E-mail: epsl@asu.edu 
http://edpolicylab.org 



Teachers’ Views on High-stakes Testing: 
Implications for the Classroom 

Lisa M. Abrams 



Boston College 




Executive Summary 



There is an appealing logic associated with current models of test-based 
accountability: the interplay among content standards, state tests, and accountability is a 
powerful tool to improve the quality of schools. However, when high-stakes 
consequences are attached to test results for schools, teachers, and students, unexpected 
consequences may outweigh the intended benefits. To explore the policy impact of 
Florida’s state testing and accountability program on classroom practices, teachers, and 
students as perceived by educators, this brief presents the results of a national survey in 
which the responses of Florida teachers are compared with those of practitioners in other 
states using high-stakes exams. The findings reveal that, compared to their counterparts 
in other high-stakes states, teachers in Florida perceived a more pronounced impact of the 
state test. 

Recommendations 

1 . Florida should undertake a long-term evaluation and monitoring program to 
assess the impact of the Florida Comprehensive Assessment Test (FCAT) and 
the A+ Accountability program. This evaluation and monitoring program 
should be conducted by an external organization or research institution. Its 
purpose is to determine if the state testing program is achieving its intended 
goals. The evaluation should also examine the unexpected consequences of 
the FCAT and A+ Accountability program on the educational process and on 
key stakeholders. 

2. Florida testing policy should adhere to the recognized professional standards 
regarding test development and to the appropriate use of test results as 




described in the Standards for Educational and Psychological Testing, 
published jointly by the American Educational Research Association, the 
American Psychological Association, and the National Council on 
Measurement in Education. 

3. Florida policy makers should not make highly consequential decisions about 
students (such as deciding whether a student is promoted to the next grade or 
is awarded a high school diploma) by means of test scores alone. Given the 
evidence pointing to weaknesses in the testing system, it is important to use 
other sources of information in conjunction with state tests. 
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Teachers’ Views on High-stakes Testing: 

Implications for Classroom Practice and Student Learning 1 

Section 1: The Issue 

Accountability-based testing policies have evoked heated debate, especially as 
states realize full implementation of their education refonn policies. The 2002-2003 
school year marked the first time that students’ scores on the Florida Comprehensive 
Assessment Test (FCAT) factored into decisions about grade promotion and graduation. 
Headlines in Florida newspapers were ripe with accolades, reports of failure, threats of 
legal action, calls for reform and organized protests of the state’s education 
accountability system — “Students Feel Sting of Reform,” “Critics of Graduation Exam 
Threaten Boycott in Florida,” “1,300 Third-Graders in Palm Beach County Held Back by 
FCAT Scores,” and “Governor Jeb Bush Announces Biggest Improvement Ever on 
FCAT” are just a few examples. The debate over how to improve schools, education 
quality, and student academic performance is one that many policymakers, legislators, 
administrators, teachers, students, parents and advocacy organizations are grappling 
with — not just in Florida, but across the nation. 

A growing body of evidence suggests that high-stakes testing can be a driving 
force behind fundamental change within schools." There is a difference of opinion as to 
whether this change is improving the quality of education, however. For example, 
whereas some contend that the guarantee of rewards — or the threat of sanctions — is 
essential to promote quality teaching and to encourage higher standards of achievement, 
others maintain that high-stakes tests limit the scope of classroom instruction and student 
learning in undesirable ways. Regardless of one’s position on this issue, it is impossible 




to deny that statewide testing policies influence classroom instruction and student 
learning. This brief explores how public school teachers in Florida perceive the effects of 
the state test and accountability system on their instruction and their students. 

Section 2: Background 

Current Testing Landscape 

Current testing policies grew out of standards-based reform begun in the early 
1990s. This initiative called for a rigorous and demanding curriculum that, in addition to 
requiring students to demonstrate their command of basic content knowledge, also asked 
them to exhibit higher-level cognitive processes (e.g., application, problem solving, 
inquiry). The shift from basic skills to high standards has given rise to the current state- 
level accountability systems designed to hold schools, administrators, teachers (and 
sometimes students) responsible for meeting these raised expectations. These systems 
have four main components: content standards that communicate the desired content 
knowledge and skills; tests that measure progress toward achieving the content standards; 
performance targets that identify criteria used to determine whether schools, students, or 
both have reached the desired level of achievement; and incentives, such as the 
consequences (rewards and sanctions), or stakes, that reinforce the attainment of 
performance targets. 4 

Over the last decade, test-based accountability systems have become widespread; 
every state except Iowa has instituted curricular standards or frameworks. 5 Moreover, 
every state uses a test to measure the degree to which students have mastered the 
knowledge and skills expressed in these standards. 6 While on the surface it might appear 
that state testing policies are becoming increasingly similar, there are substantial 
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differences in test content, item format, and how test results are used, especially for 
accountability purposes. 7 

The focus on state assessment requirements was further emphasized in the 200 1 
reauthorization of the Elementary and Secondary Education Act (ESEA), known more 
commonly as the No Child Left Behind Act (NCLB). This far-reaching legislation seeks 
to raise the level of achievement for all students and to reduce the gap in the level of 
performance of students from different backgrounds. At the heart of NCLB are the 
assessment and accountability requirements, which will substantially increase the extent 

o 

to which students are tested. The law includes school accountability provisions; 
however, states still retain the authority to determine how, or if, students will be held 
responsible for test performance. 

The implementation of NCLB has expanded a majority of current state 
assessment programs, requiring testing at more grade levels. The federal law requires 
that states annually administer reading and math tests to all students in Grades 3-8, and in 
one year between Grades 9-12 starting in 2005-2006. 9 This requirement affects at least 
25 million students annually. 10 Arguably, NCLB is one of the most aggressive federal 
efforts to improve elementary and secondary education and marks a major departure from 
the traditionally noninterventionist role of the national government in forming state 
education policy. 

Education Reform in Florida 

The state testing and accountability program in Florida has met with controversy 
since the early 1970s. Originally, the state test targeted basic skills. Although this type 
of minimal competency testing was common, Florida was the first in the nation to tie 



Page 3 of 36 




students’ test results to graduation — a policy that was broadly criticized, with its 
legitimacy ultimately decided in the federal courts. 11 Throughout the 1980s, education 
policy at both the federal and state levels was moving away from emphasizing basic 
skills — and toward requiring students to meet higher or “world class” content 
standards — in an attempt to ensure that the United States continued to have a competitive 
edge in the growing world market. State testing policies in Florida reflected this shift. 

In 1995, the Florida Commission on Education Reform and Accountability called 

for the creation of rigorous state curriculum frameworks and recommended changes in 

the state assessment program. The FCAT, designed to measure the extent to which 

students had achieved the Sunshine State Standards in reading, writing and math, was 

first administered in 1998 (in Grades 4, 5, 8, and 10) and became a high school 

graduation requirement for the class of 2003. Currently, 2 1 other states use test results to 

make similar high-stakes decisions for students; by 2008, 24 states will base graduation 

12 

and grade promotion on state test results. 

In 1999, as part of Governor Bush’s A+ Accountability Plan, the state assessment 
program was expanded to include more students at more grade levels. Currently, 
students in Grades 3-10 are tested annually in reading and math. Writing tests are 
administered in Grades 4, 8, and 10, and science exams in Grades 5, 8, and 10. Since 
1999, student performance on the FCAT has been used, in part, to assign schools grades 
(A-F) which are used to monitor performance, identify unsuccessful schools, and 
leverage rewards and sanctions. The A+ Accountability plan also includes a major 
reform of the accountability system by providing “opportunity scholarships” or vouchers 
to students in consistently low-performing or F schools that allows them to transfer to 
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alternative higher-performing public or private schools. 14 Schools can move out of the F 
category by improving FCAT scores. Recent NCLB legislation contains provisions that 
require states to implement similar “choice” options, such as opportunity scholarships, 
for students in schools that consistently do not meet adequate yearly progress goals. In 
many ways, how teachers respond to the requirements of the FCAT and the A+ 
Accountability Plan may foreshadow reactions in other states. 

The Effects of High-Stakes Testing 

In order to examine how Florida’s testing and accountability polices are affecting 
classroom instruction and student learning, it is first necessary to summarize the findings 
of research studies in this area, thus providing a frame of reference. Numerous studies 
have investigated the effects of state-mandated testing programs, particularly those with 
high stakes attached to test results. 15 The majority have addressed a variety of issues 
related to the effects on teaching and learning, specifically on the content of instruction: 
the strategies used to deliver instruction; the impact of the format of the state test on 
classroom practices; the test preparation; and the psychological impacts of the test on 
both teachers and students (e.g., pressure, morale, and motivation), as well as on student 
learning in general. 

Impact on Classroom Practices 

Much of the research addresses the effects of state testing programs on what is 
taught. A common finding is that teachers report giving greater attention to content areas 
on which students will be tested. For example, of the 722 Virginia teachers surveyed, 
when the state test was first implemented, more than 80 percent indicated that the 
Standards of Learning (SOL) test had affected their instruction, especially the content 
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focus of daily lessons. 16 It only makes sense that increased attention to tested content 
would result in decreased time spent on other areas of the curriculum. In Kentucky, 87 
percent of teachers agreed that their state test, the Kentucky Instructional Results 
Information System (KIRIS), had “caused some teachers to de-emphasize or neglect 
untested subject areas.” 17 Results from a national survey of 4,200 teachers confirm these 
state-level findings — 76 percent of the responding teachers indicated that they have 
increased the amount of time they spend on tested content areas, while more than half (52 
percent) indicated they had decreased the amount of class time devoted to content areas 
not covered by the state test. 18 

The impact of the state test on instructional strategies is mixed. Studies in states 
that require students to formulate and to provide written responses to test questions show 
an increased emphasis on writing and higher-level thinking skills. 19 For example, in 
Kentucky, 80 percent of the fourth and eighth grade mathematics teachers increased 
instructional emphasis on problem solving and writing as a result of the portfolio-based 
state test. - In contrast, teachers decreased the use of more time-consuming instructional 
strategies and lengthy enrichment activities." A more a recent study found that the 
format of the state test may adversely affect the use of technology for instructional 
purposes. For instance, one-third of teachers in high-stakes states were less likely to use 

computers to teach writing because students were required to construct handwritten 

22 

responses on the state test." 

Emphasis on Test Preparation 

The pressure to respond to increased demands of the state test often requires 
teachers to place more emphasis on test preparation. In Maryland, 88 percent of teachers 
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surveyed felt they were under “undue pressure” to improve student performance on the 
state test. - When asked the same question, 98 percent of Kentucky teachers responded 
similarly. 24 Ninety percent of teachers surveyed nationally reported feeling pressure from 
their district superintendent to raise test scores, while 79 percent indicated feeling 
pressured by their building principal to improve student perfonnance. 

In light of this pressure, teachers may put greater emphasis on preparing students 
for the state test. Of the 470 elementary teachers surveyed in North Carolina, 80 percent 
reported that “they spent more than 20 percent of their total instructional time practicing 
for the end-of-grade tests.” ' Similarly, a survey of reading teachers in Texas revealed 
that, on average, teachers spent 8 to 10 hours per week preparing students for the Texas 
Assessment of Academic Skills (TAAS). 

Overemphasis on specific test preparation activities has given rise to concerns 
about the validity of test scores as accurate measures of students’ levels of achievement. 
Specific preparation activities such as coaching, teaching test-taking skills, and 
instruction geared toward the test can yield invalid test results. One would expect that 
if students’ scores improve on the state test (from year to year), scores on other tests that 
measure the same content, skills, or both, should show similar improvement. When 
trends in student perfonnance levels on similar standardized tests are not consistent, the 
accuracy of a particular test as an indicator of student achievement is questionable. For 
example, 40 percent of teachers surveyed nationally reported that they had found ways to 
raise state test scores without really improving learning. Similarly, 50 percent of 
responding Texas teachers did not think that the rise in TAAS scores “reflected increased 
learning and high quality teaching.” Based on written comments, the study authors 
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concluded that “teachers regarded improvement on the TAAS as a direct result of 

77 

teaching to the test.” - 

Student performance on a highly consequential test may not generalize to other 
measures of achievement. Several studies have compared students’ perfonnance on the 
state test with performance on other standardized tests that assess similar information. 
Researchers found that gains on the KIRIS math test were substantially larger than 
improvements for Kentucky students on the math portion of the National Assessment of 
Educational Progress (NAEP). This suggests that improved performance on the KIRIS 
math test does not necessarily reflect broader gains in student knowledge. - Further, 
when student performance on several different standardized tests in 18 states with high- 
stakes testing programs was systematically compared, the findings were inconclusive: 
there was not a strong link between the implementation of the state testing program and 
improvement in student achievement. 29 

Impact on Motivation and Morale 

High-stakes tests may motivate certain teachers and some students to achieve 
optimal performance levels. However, researchers have cautioned that placing a 
premium on student test performance can lead to instruction that is reduced primarily to 
test preparation, thus limiting the range of educational experiences for students and 
constraining the pedagogical skills of teachers. Studies have also shown that high- 
stakes assessments increase stress and decrease morale among teachers. More than 77 
percent of North Carolina teachers surveyed indicated decreases in their morale; 76 
percent reported that teaching was more stressful since the implementation of the North 
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3 1 

Carolina state testing program. In Texas, 85 percent of teachers surveyed agreed with 

32 

the statement “some of the best teachers are leaving the field because of the TAAS.” 

Increased levels of anxiety, stress, and fatigue are frequently reported effects on 
students that may prohibit optimal performance on the state exam. In a national teacher 
survey, 75 percent reported that students were under intense pressure to perform well on 
their state test; 76 percent indicated that they perceived students to be extremely anxious 
about taking the exam/ One third of Kentucky teachers reported that student morale has 
declined in response to the KIRIS. 34 Other research conducted in Chicago involving 102 
low-achieving 6 th and 8 th grade students illustrates the positive impact the test can have 
on student motivation. The results suggest that high-stakes testing encouraged a majority 

35 

of students to work harder and in many cases led to score improvements. 

Accountability and Meaning of Test Results 
Not only do the results of state tests provide information about the progress of 
individual students, they are often aggregated to evaluate the performances of both 
schools and districts. In 2002, 17 states offered schools rewards for high or improved test 
scores; at least 19 attached sanctions for schools consistently exhibiting poor student 
performances on the state test; and 8 pennitted students to transfer out of low-performing 
schools. In addition to losing accreditation if students perform poorly on the state test, 
schools may also lose funding and face the possibility of closure or reconstitution. 

Several studies have examined teachers’ views on accountability and found both 
positive and negative results. In North Carolina, 76 percent of teachers reported that they 
“believed that the accountability program would not improve the quality of education in 
their state,” as compared with a majority of teachers surveyed in Kentucky who held 
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38 

positive perceptions of the impact of the state test on instruction. Research conducted 
in Maine and Maryland suggests that teachers’ perceptions within the same state were not 
always consistent. In other words, the intended effects of the rewards and sanctions tied 
to test perfonnance may be influenced by other factors specific to schools and districts, 
such as the availability of resources and professional development opportunities. As a 
result, state-testing policies may produce inconsistent and varied effects across schools 
and districts. 39 

Research on the effects of state-mandated testing programs, as perceived by 
teachers, has revealed mixed effects: testing policies have both positive and negative 
impacts on instruction and learning, as well as on the teachers and students themselves. 
Therefore, it is necessary to determine if the benefits outweigh the potentially negative 
results, which, although unintended, may distort the educational process and are 
associated with great human costs. In order to address these complex issues, it is 
necessary to determine whether state testing policies are having the intended effect — not 
just by relying on information provided by test scores. A critical look at changes in 
schooling, as well as in teacher and student behaviors, is crucial. It is necessary to 
include those closest to the educational process in this endeavor. 

Section 3: Data 

In an effort to determine how state testing programs are affecting instruction and 
learning, the National Board on Educational Testing and Public Policy 40 sought the 
opinions of classroom teachers. The board’s 2001 survey of teachers in 47 states 
included questions about the following: time spent on instruction in tested and non-tested 
areas; aligmnent of state standards and tests with teachers’ instruction and classroom 
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assessments; pressure to improve test scores; school climate and morale; teachers’ views 
on accountability; and their general opinions about state testing programs. 41 From a 
sample of 12,000 public school elementary and secondary teachers, about 4,200 
responded to the mail survey (35 percent). Of the responding teachers, 167 were from 
Florida. 

This section compares the responses of Florida teachers with those of teachers in 
other states using high-stakes testing 42 — a useful comparison considering the federal 
push toward accountability and the increasing number of states using test results for 
grade promotion or graduation decisions. High-stakes policies affect the majority of the 
nation’s public school teachers and students. The states using high-stakes testing are 
generally the most populous; by 2008, state exit exams will affect 7 out of 10 public 
school students. 43 These comparisons provide insight into the similarity between the 
perceptions of teachers in Florida and other states of the effects of their states’ testing and 
accountability policies. As shown in Table 1, Florida teachers were slightly more diverse 
with regard to race/ethnicity and teaching experience than were teachers in other high- 
stakes states, but otherwise these are relatively similar groups. 
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Table 1: Profile of Responding Teachers from Florida and Other High-stakes 
States 44 







Percent in 
Florida 


Percent in Other 


Respondent Characteristics 


High-stakes 

States 


Gender 


Female 


81 


83 


Male 


19 


17 


Age 


Over 40 


59 


67 


Race/Ethnicity 


African 

American 


11 


9 


Hispanic 


13 


6 




White 


77 


84 


Years of teaching 
experience 


1-12 


50 


38 


13-20 


9 


24 


Over 20 


41 


38 




Elementary 


53 


59 


School type 


Middle 


25 


20 




High 


22 


21 



Source: Pedulla, J., Abrams, L., Madaus, G., Russell, M., Ramos, M., & Miao, J. (2003). Perceived 
effects of state-mandated testing programs on teaching and learning: Findings from a national survey of 
teachers. Chestnut Hill, MA: National Board on Educational Testing and Public Policy, Boston College. 



The survey results are organized into several broad areas: general views on the 
state standards and tests; impact on classroom practices; pressure on teachers and test 
preparation; impact on motivation and morale; views on the use of tests for purposes of 
accountability; and the meaning of test scores. Tables 2 and 3 present a summary of the 
results. 

Views on State Standards and the State Test 
Compared to teachers in other high-stakes states, Florida teachers held more 
positive views on state standards and the compatibility of the state test with their 
classroom instruction. Roughly three out of every four Florida teachers reported that the 
FCAT is based on a curriculum framework that all teachers should follow; a significantly 
smaller percentage of teachers in similar settings held this view (59 percent). Moreover, 
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79 percent of teachers in Florida, compared with 64 percent in other high-stakes states, 
found the state test compatible with their daily classroom practices. A greater percentage 
of Florida teachers also found the state test to have had a positive impact on the political 
agenda by bringing much needed attention to education issues than did teachers in states 
with similar types of testing programs (58 percent v. 42 percent). Florida’s teachers’ 
responses were similar to those in other high-stakes states: a majority in both groups 
reported that the state test measures high standards of achievement and that if they taught 
to the curriculum frameworks or standards, students would be successful. 

Impact on Classroom Practices 

Survey results as to the impact of the state test on classroom practices were 
consistent with research findings in this area. Teachers in Florida reported in larger 
percentages than did teachers in other high-stakes states that they had greatly increased 
time spent on tested material (62 percent v. 42 percent). Not surprisingly, a majority of 
teachers from both groups reported that they had decreased instructional time devoted to 
non-tested content (67 percent v. 58 percent). Florida teachers were almost twice as 
likely as teachers in states with similar testing programs to report fewer classroom 
enrichment activities, although the percentages were fairly small — 23 percent compared 
with 12 percent, respectively. 

With regard to the use of technology, Florida teachers reported in significantly 
larger percentages (than did teachers in other high-stakes states, 42 percent v. 32 percent) 
that they did not use computers when teaching writing because the state writing test 
requires students to provide handwritten responses. A consideration: tests that require 
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handwritten responses underestimate the abilities of students who are accustomed to 
writing with computers. 45 

The overwhelming majority of teachers reported that the state testing program has 
led them to teach in ways contrary to their own ideas of sound educational practices. 
Florida teachers, though, were significantly more likely to hold this view than were 
teachers in other states with high-stakes policies (90 percent v. 75 percent). Survey 
results suggest that teachers in high-stakes settings gear the content of instruction to that 
of the state test, and that they are also modeling their own classroom assessments based 
on the test’s format. Florida teachers reported these changes in greater percentages than 
did teachers in other high-stakes states. Furthermore, a great percentage of educators in 
Florida perceive that the instructional changes — required to meet the demands of the state 
testing programs — do not translate into good practice. 

Pressure on Teachers 

Compared with teachers in other high-stakes settings, those in Florida were more 
likely to feel pressure to raise test scores, especially from sources external to their school. 
Teachers in Florida were twice as likely to report test-related pressure coming from the 
district superintendent (in contrast to principals, 80 percent v. 40 percent). In general, 
Florida teachers reported feeling more pressure. Florida teachers also were significantly 
more likely than their counterparts in other high-stakes settings to strongly agree that the 
pressure for high scores on the state test was so intense that they had little time to teach 
anything not on the test (63 percent v. 40 percent). Test preparation took additional time 
for Florida teachers — 54 percent of instructors reported this to be so, as compared with 
43 percent of the educators in other high-stakes settings. The ways in which teachers in 
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both Florida and other high-stakes states prepared students were fairly similar, however. 
The use of commercially available or state-developed materials and released items were 
most common among both groups when getting students ready for the state test, although 
teachers in Florida were more likely to use prepared resources (75 percent v. 63 percent). 
Overall, when compared with teachers from other states that rely on high-stakes testing, 
Florida educators — in significantly greater percentages — felt pressure to raise students’ 
test scores. Florida instructors spent more time on test preparation — upwards of 30 
hours per year — toward this effort. 



Page 15 of 36 




Table 2: Percentage Agreement of Teachers in Florida and Other High-stakes 
States 46 







Other 


Survey Items 


Florida 


High- 

stakes 






States 


Views on Standards and the State Test 


The state test is compatible with my daily instruction. 


79* 


64 


The state test is based on a curriculum that all teachers in my state 
should follow. 


71* 


59 


If I teach to the state standards or frameworks, students will do 
well on the test. 


57 


54 


The state test measures high standards of achievement. 


54 


48 


The state testing programs has brought much needed attention to 
education issues in my state. 


58* 


42 



Impact on Classroom Practices 


Greatly increased time spent on instruction in tested areas. 


62* 


42 


Decreased time spent on instruction in non-tested areas. 


67 


58 


Greatly decreased time spent on class enrichment activities. 


23* 


12 


My tests are in the same format as the state-mandated test. 


61* 


50 


I do not use computers when teaching writing because the state 
test is handwritten. 


42* 


32 


The state testing program has lead teachers to teach in ways that 
contradict their own ideas of good educational practice. 


90* 


75 




Pressure on Teachers to Prepare Students for the State Test 


Strongly agreed they felt pressure from their district 
superintendent to raise scores on the state test. 


80* 


56 


Strongly agreed they felt pressure from their building principal to 
raise scores on the state test. 


48 


40 


Strongly agreed that there is so much pressure for high scores on 
the state-mandated test that they have little time to teach anything 
not on the test. 


63* 


40 


Spent more than 30 hours per year preparing students specifically 
for the state test. 


54* 


43 


Prepared students for the state-mandated test throughout the year. 


80 


70 


Used test preparation materials developed commercially or by the 
state. 


75* 


63 


Used released items from the state test to prepare students. 


43 


44 


Taught test taking skills to prepare students. 


85 


85 



Source: Pedulla et al., 2003. 

* Percentage differences are significant at alpha =.001. 
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Impact on Motivation and Morale 



With regard to the impact of the state test on teachers’ and students’ respective 
levels of motivation and morale, the survey results for teachers in Florida and in other 
states with a high-stakes testing policy are consistent with prior research in this area. An 
overwhelming percentage of teachers in both settings — roughly 95 percent — reported that 
their schools’ atmospheres were conducive to learning. A large majority of teachers in 
both settings also reported having high academic expectations for students (roughly 90 
percent) and that student morale was high in their school (approximately 70 percent). 

Yet, less than a majority of teachers in both Florida and other high-stakes states indicated 
that teacher morale was high (about 45 percent). Moreover, a greater percentage of 
Florida teachers, compared with those in other high-stakes states, reported that teachers in 
their schools wanted to transfer out of the grades in which the state test is administered 
(49 percent v. 38 percent). 

Florida teachers were also significantly more likely to report the perception that 
students were extremely anxious about the state test (92 percent v. 79 percent) and were 
under intense pressure to perform well (92 percent v. 79 percent). Even though a large 
majority of teachers in both Florida and other high-stakes states indicated that many of 
their students do try their best (approximately 90 percent), a smaller but sizable 
percentage, roughly 50 percent, reported that many students in their classes feel that, no 
matter how hard they try, they will still do poorly on the state test. Last, a significantly 
greater percentage of Florida teachers indicated that the testing program has led students 
to drop out of high school (38 percent v. 27 percent). A similar percentage of teachers in 
both groups, however, reported that the state test has contributed to an increase in grade 
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retention, roughly 30 percent. In general, Florida teachers were significantly more likely 
to perceive that students experienced high degrees of both test-related anxiety and 
pressure, compared with teachers in other states using high-stakes exams. Respondents 
in Florida and other high-stakes states gave similar responses with regard to teacher and 
student morale, school atmosphere, and expectations held of students; however, teachers 
in Florida were more likely to express a preference to relocate to grades in which the state 
test was not administered. 

Views on Accountability and the Meaning of Test Scores 

With the exception of their opinions on the use of test results to award school 
accreditation, the responses of Florida teachers were similar to those of practitioners in 
other high-stakes states. By and large, the majority of both groups of teachers regarded 
the use of test results for school and student accountability purposes as inappropriate. 

For example, a sizable percentage of teachers found using test results to award school 
accreditation inappropriate; Florida teachers were significantly more likely to hold this 
view (77 percent v. 65 percent). Roughly 60 percent of teachers in both groups reported 
that using scores on the state test to make decisions about grade promotion, or about 
retention, was inappropriate. Yet the decision to grant a student a high school diploma 
can in fact be determined by tests: a majority of teachers in Florida and in other states 
using high-stakes testing held this view (approximately 55 percent). 

Florida teachers and their counterparts in other high-stakes states held especially 
strong views about the capacity of the state test to serve both as an accurate measure of 
student achievement and as an indicator of school quality. Roughly 80 percent disagreed 
with the statement that the state test is as accurate a measure of student achievement as a 
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teachers’ judgment. A similar percentage of teachers in both groups also disagreed that 
the scores on the state test accurately reflect the quality of education students have 
received. Moreover, Florida teachers reported — in greater percentages than did teachers 
in similar high-stakes settings — that they had found ways to raise students’ scores on the 
state test without really improving learning (59 percent v. 39 percent); thus suggesting 
that improvements on the state test may not show up on other tests that measure similar 
content and skills. 
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Table 3: Percentage Agreement of Teachers in Florida and Other High-stakes 
States 



Survey Items 


Florida 


Other 

High- 

stakes 

States 


Impact on Motivation and Morale 


My school has an atmosphere conducive to learning. 


95 


92 


Teachers have high expectations for the academic 
performance of students in my school. 


92 


91 


Teachers in my school want to transfer out of the tested 
grades. 


49* 


38 


Teacher morale is high in my school. 


46 


44 


Strongly agreed that students are extremely anxious about 
taking the state test. 


92* 


79 


Students are under intense pressure to perfonn well on the 
state test. 


92* 


79 


The majority of students try their best on the state test. 


91 


83 


Student morale is high in my school. 


71 


64 


Many students in my class feel that, no matter how hard 
they try they will still do poorly on the state test. 


44 


52 


The state test has caused many students in my district to 
drop out of high school. 


38* 


27 


The state test results have led to grade retention in my 
district. 


30 


27 




Views on Accountability and the Meaning of Test Scores 


Inappropriate to use test results to award school 
accreditation 


77* 


65 


Inappropriate to use test results to promote or retain 
students in grade 


62 


59 


Appropriate to use test results to award high school 
diplomas 


52 


57 


Teachers in my school have found ways to raise state- 
mandated test scores without really improving learning. 


59* 


39 


The state test is as accurate a measure of student 
achievement as a teacher’s judgment. 


14 


19 


Scores on the state test accurately reflect the quality of 
education students have received. 


13 


20 



Source: Pedulla et al., 2003. 

♦Percentage differences are significant at alpha = .001. 
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Quality of Available Data 



All measures that rely on self-reported data rest on the assumption that the 
participants respond truthfully and accurately; unfortunately, it is almost impossible to 
test this assumption. Efforts at the developmental stage, as well as statistical analyses, 
suggest that the survey instrument used in the national study and the results described 
herein are reliable and valid. In an effort to produce a high quality measure of teachers’ 
opinions, the survey was based in part on other surveys used to gather teachers’ opinions 
about the effects of state testing programs. 47 In addition, analyses revealed a high level 
of consistency among teachers’ responses to the survey items. This indicates that the 

48 

measurement tool produced reliable results. 

Another consideration when assessing the quality of survey data is whether the 
respondents are actually representative of the population to which results are generalized. 
The teachers who completed the National Board survey were comparable to the national 
teaching force in terms of their ages, races/ethnicities, the types of schools in which they 
taught, and their years of teaching experience. 49 In addition, the responding group of 
Florida teachers was similar to the larger teaching population in the state with regard to 
sex, race/ethnicity, and type of school. 50 The available data present a national picture of 
how teachers are responding to different types of testing programs. Although the 
sampling design for the national survey was not intended for disaggregating and reporting 
results at the state level, the reasonably sufficient number of responding teachers from 
Florida provides for useful comparisons. These results, however, should be interpreted 
with some caution and viewed within the limitations of self-reported data. 
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Section 4: Findings 



Florida teachers’ survey responses reflect mixed views on the state testing and 
accountability program. Generally, the most positive views were directed toward the 
state’s curriculum frameworks. For example, a large majority reported that the Sunshine 
State Standards should be followed, and that the state test was compatible with their daily 
instructional habits. About half regarded the state test as a measure of high standards of 
achievement. An alternative way of interpreting teachers’ responses to the questions 
about their views on standards is that they have accepted the content of the state test as 
reflecting that of the state standards, and agree that if they teach to those standards, their 
students will do well on the test. This is one view that may account for the divide 
between teachers’ more positive opinions toward the content standards and more negative 
views on the state test. However, research has consistently shown that teachers generally 
hold positive views toward content standards and find that they provide instructional 
focus, ensure greater homogeneity across classrooms with regard to instructional content, 
and facilitate collaboration with their colleagues. 51 

With regard to classroom practices, Florida teachers’ responses were consistent 
with prior research findings. The majority reported that they had reallocated instructional 
schedules, allowing for more time to be spent on tested content while decreasing the time 
ordinarily devoted to material that would not appear on the FCAT. In addition, a fourth 
of the respondents indicated that they had also decreased time spent on enrichment 
activities in order to prepare students for the state test. 

Teachers’ responses suggest that the content of the state test influences their 
instruction, as does the format of the FCAT: a sizable minority indicated that they do not 
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use computers for writing instruction because students are required to provide 
handwritten responses to questions on the test. In addition, a majority of teachers also 
changed their assessment practices by modeling their own classroom tests after the 
format of the state test. Although a substantial percentage of teachers reported changes in 
their classroom practices, it was clear that they were not comfortable with making them: 
90 percent indicated that the state test required them to teach in ways contrary to their 
own ideas of good practice. 

Results also suggest that for many teachers, schools are highly stressed 
environments where a premium is placed on improving student test performance. Florida 
teachers held especially strong opinions about the pressure to raise test scores; 63 percent 
indicated the pressure was so great that they had little time to teach anything that would 
not appear on the test. Furthermore, a majority reported that they had found ways to raise 
test scores without improving learning. Taken together, these findings suggest that 
pressure to raise test scores has forced educators to fixate on short-term, immediate goals, 
perhaps at the expense of developing skills that encourage long-term, independent 
learning. 

Results also suggest that the state testing and accountability program in Florida 
may have potentially negative implications for the teaching profession. Less than half of 
the teachers surveyed reported that morale was high at their school; further, a majority 
indicated that teachers wanted to transfer out of the grades in which the test was given A 
Since the National Board survey was conducted, the number of grades in which the 
FCAT is given has increased; now, all Grades 3-8 and 10 are tested in at least one subject 
of the FCAT. As a result, there are few classroom practitioners unaffected by the state 
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test; instead of making changes to their teaching assignments within their respective 
schools or districts, they may choose to leave the profession altogether. 

The pressure to succeed on the state test did not rest solely with teachers; in 
Florida, students were also affected by the FCAT. More than 90 percent of respondents 
believed that students were under intense pressure and were highly anxious about the test. 
While moderate levels of test-related anxiety can actually improve motivation and test 
performance, an unmanageable amount can have an adverse effect. Students who view 
the test as an insurmountable barrier are also more likely to give up/ A sizable 
percentage of teachers reported that many of their students have adopted the view that 
passing the state test may be a hopeless goal: 45 percent of teachers indicated that no 
matter how hard they tried, many students felt that they would still do poorly on the 
FCAT. The fact that almost 40 percent of teachers reported that the state testing program 
had contributed to students’ decisions to leave high school is disquieting. 

In order to meet high school diploma requirements, legislative provisions have 
allowed students to use scores on alternative assessments such as the Scholastic 
Achievement Test (SAT), the American College Test (ACT), and the College Placement 
Tests in lieu of 10 th grade FCAT scores. 54 Although this was an immediate solution to 
the potentially large numbers of Grade 12 students in Florida who failed to meet the 
FCAT requirements and who thus would not receive high school diplomas, it does not 
solve the problem in the long tenn. Only 6 out of 67 school districts had a majority of 
Grade 12 students pass the reading section of the 2003 FCAT; 10 districts boasted similar 
results on the math portion. 55 Tenth grade results from the previous year (2002) suggest 
that the picture is not as bleak, but it is still very troublesome: 73 percent passed the math 
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portion and 59 percent passed the reading test. When the results are disaggregated by 
race, however, that rate of success is far lower for Black and Hispanic students than for 
their White or Asian counterparts. 56 These results suggest that for many students, 
especially for those of minority groups, failure on the FCAT is a real possibility, thus 
increasing the likelihood that they may leave the public school system altogether. 

Already the reality of decreased graduation rates and increased rates of grade 
retention is evident. Data (1984-2000 state enrollment statistics) reveal that Florida has a 
63 percent graduation rate, one of the lowest in the nation. This figure is found by 
comparing enrollments for Grade 8, four and a half years earlier, with the number of 
students who eventually graduated. Florida also has the highest Grade 9 to 10 attrition 
rates. One out of every four 9 th grade students in the 1999-2000 academic year did not 
progress to the 10 th grade the following school year (2000-2001). 57 These statistics, 
coupled with the fact that 30-40 percent of 10 th grade high school students did not pass 
either the math or the reading portion of the 2002 FCAT, are alarming and are indicative 
of the widespread human costs associated with high-stakes testing programs. 

With regard to using test scores for accountability purposes, the majority of 
teachers did not support using test results to hold schools accountable, but their opinions 
about student accountability were mixed: a majority supported using test scores as a 
criterion for graduation. Teachers were less supportive of using results to determine 
grade promotion or retention, however. These findings may be explained, in part, by a 
belief on the part of teachers that accountability for student test performance should be 
shared among schools, teachers and students. Florida teachers held much more decisive 
views on the meaning of test scores. Overwhelmingly, educators did not consider 
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students’ scores on the FCAT accurate indicators of either student achievement or school 



quality, and regarded their own professional judgments as more precise measures of 
student learning. 

Perhaps teachers recognize that like any measurement tool that produces a 
number — such as blood pressure gauges, complex laboratory tests, radar detectors, and 
breathalyzers — test scores are fallible. Most state laws, however, do not consider margin 
of error when interpreting a student’s scores. Misguided executive decisions, poorly 
conceived policies, understaffing, unrealistic deadlines, and unreasonable progress goals 
can cause numerous errors in test scores. The extent to which human errors in scoring, 
programming, and reporting are widespread has been thoroughly documented. Students 
have been wrongfully denied high school diplomas, mistakenly thought to have failed 
state exams, and erroneously required to attend summer schools as a result of test score 
errors. For example, in Florida, when the tests of 71 third-graders who had scored below 
the passing cut-off score were reviewed and scored by hand, the process unearthed a 
scoring machine error — leading a parent to comment, “Instantly, Raven was transformed 
from a 3 rd -grade dufus to a state-certified 4 th grader.” 59 Given these limitations and 
potential for serious harm, the American Educational Research Association has put forth 
a position statement that cautions policy makers about the use of high-stakes testing and 
advocates that test scores should not be used in isolation to make important decisions 
about “students’ life chances or educational opportunities.” 60 

When the survey responses of Florida teachers were compared with those of 
teachers in other high-stakes states, clear differences emerged. Teachers in Florida were 
more likely to hold positive views on the state standards and on the compatibility of the 
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testing program with their instruction than were teachers in other states using high-stakes 
tests. In addition, comparisons indicated that the state test had a greater impact on the 
classroom practices in Florida, as evidenced by the larger percentages of the state’s 
teachers who reported spending more time on tested content, and less time on enrichment 
activities; modeling the format of the state test in their own assessments; providing 
writing instruction in longhand rather than with computers (because computers were not 
pennitted for writing on the state test); and teaching in ways that contradicted their own 
beliefs about good practice. 

Florida teachers were more likely than their counterparts in other high-stakes 
states to report feeling pressure to raise test scores. They also were more likely to teach to 
tested content and to spend instructional time on test preparation. Teachers in Florida 
were, in addition, more likely to perceive negative effects on students, compared with 
teachers in other high-stakes states, as they reported in greater percentages that their 
students experienced high levels of test-related anxiety and pressure. Further, a larger 
percentage of Florida teachers reported that the state test has led students to drop out of 
high school, compared with teachers in other states that rely on high-stakes exams. 
Although a larger majority of teachers in Florida held the state standards in higher regard 
compared to their counterparts in other high-stakes states, they were also more likely to 
oppose the use of test results for school accountability purposes. However, Florida 
teachers’ views regarding student accountability were similar, with a majority in both 
groups supporting the use of test results to award high school diplomas. 

Overall, the survey results indicate that, when compared to teachers in other states 
that use high-stakes testing, the state testing program in Florida is having a more 
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pronounced effect, in both positive and negative ways, on classroom practice, on 
teachers, and on students. In addition, according to Florida teachers, the extent to which 
FCAT scores are accurate measures of student achievement is questionable: the pressure 
to raise test scores may have distorted the educational process by requiring educators to 
over-emphasize test-specific preparation, and to implement what they view as unsound 
instructional practices. 

Section 5: Recommendations 

The importance of shared accountability in efforts to improve the quality of 
education and to enhance student learning is undeniable; the value of information 
imparted by measures of achievement is irrefutable. When teachers and students are the 
main parties on which rewards or sanctions are leveraged, however, research indicates 
that the educational process becomes distorted. In the attempt to improve indicators on 
which highly consequential decisions are based, efforts are directed toward the rapid 
raising of test scores, often at the expense of more effective educational practices. The 
current survey findings suggest that this is happening in Florida to a greater extent than in 
other states that have implemented high-stakes testing programs. These findings, 
considered together with the large body of research conducted on the effects of high- 
stakes testing on both teaching and learning, point to the following recommendations: 

1 . Florida should undertake a long-term evaluation and monitoring program to 
assess the impact of the Florida Comprehensive Assessment Test (FCAT) and 
the A+ Accountability program. This evaluation and monitoring program 
should be conducted by an external organization or research institution. Its 
purpose is to determine if the state testing program is achieving its intended 
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goals. The evaluation should also examine the unexpected consequences of 
the FCAT and A+ Accountability program on the educational process and on 
key stakeholders. 

2. Florida testing policy should adhere to the recognized professional standards 
regarding test development and to the appropriate use of test results as 
described in the Standards for Educational and Psychological Testing, 
published jointly by the American Educational Research Association, the 
American Psychological Association, and the National Council on 
Measurement in Education. 

3. Florida policy makers should not make highly consequential decisions about 
students (such as deciding whether a student is promoted to the next grade or 
is awarded a high school diploma) by means of test scores alone. Given the 
evidence pointing to weaknesses in the testing system, it is important to use 
other sources of information in conjunction with state tests. 
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